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SEQUENCE LISTI 



(1) 



INERAL INFORMATION: 
(i\ APPLICANT: Reed, John 
(ii) \lTLE OF INVENTION: Regulation of bcl-2 Gene Expression 



(iii) 
(iv) 



[BER OF SEQUENCES: 2 3 



CO] 
(A) 
(B) 
(C) 
(D) 
(E) 
(F) 



(v)' 



;SPONDENCE ADDRESS: 
>D DRESS EE: Gray, Cary, Ames & Frye 
' JREET: 401 B Street, Suite 1700 
CI:TY: San Diego 
STATE: CA 
COUNTRY : USA 
ZIP: \2101-4297 



COMPUTER READABLE FORM: 

(A) MEDIUM T^PE: Diskette 3.5 inch, 1.44 Mb storage 

(B) COMPUTER-VlBM PC compatible 

(C) OPERATING >SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0 



Version =1.25 



(vi) 



CURRENT APPLICATI 

(A) APPLICATION 

(B) FILING DATE: 

( C) CLASSIFICATION : 



DATA: 
1BER: US 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Brotman, Harris F. 

(B) REGISTRATION NUMBER: 3 5461 

(C) REFERENCE/DOCKET NUMBER: P0041US0 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (619) 699-3^30 

(B) TELEFAX: (619) 236-104 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 



CCCTTCCTAC CGCGTGCGkC 



20 



\ 



1 
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(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CTTTTCCTCT GGGAAGGATG GCGCACGCTG GGAGA 
(2) INFORMATION FOR SZQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CCTCCGACCC ATCCACGTAG 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
ACGGGGTACG GAGGCTGGGT AGGTGCATCT GGT 



) 




6 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
GTTGACGTCC TACGGAAACA 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 
CCCCCAACTG CAGGATGCCT TTGTGGAACT GTACGG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
GGGAAGGATG GCGCACGCTG 
(2) INFORMATION FOR SEQ ID NO : 8 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CGCGTGCGAC CCTCTTG 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
TACCGCGTGC GACCCTC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
TCCTACCGCG TGCGACC 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
CCTTCCTACC GCGTGCG 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE : YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
GACCCTTCCT ACCGCGT 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:i: 
GGAGACCCTT CCTACCG 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GCGGCGGCAG CGCGG 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CGGCGGGGCG ACGGA 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
CGGGAGCGCG GCGGGC 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TCTCCCAGCG TGCGCCAT ' 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TGCACTCACG CTCGGCCH 



\ 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 








GCGCCCGCCC 


CTCCGCGCCG 


CCTGCCCGCC 


CGCCCGCCGC 


v., X W W WW V V* Wj 


GCCGCTCTCC 


60 


GTGGCCCCGC 


CGCGCTGCCG 


CCGCCGCCGC 


TGCCAGCGAA 


Kj\J X w V— www WW 


CTCCGGGCCC 


120 


TCCCTGCCGG 


CGGCCGTCAG 


CGCTCGGAGC 


GAACTGCGCG 


A PGGG AGGTC 

AwwwwXlwwX w 


CGGGAGGCGA 


180 


CCGTAGTCGC 


GCCGCCGCGC 


AGGACCAGGA 


GGAGGAGAAA 




CCCGGAGGCG 


240 


GGGTGCGCCG 


GTGGGGTGCA 


GCGGAAGAGG 


GGGTCCAGGG 


GG G A G AA CTT 


CGT^GCAGTC 


300 


ATCCTTTTTA 


GGAAAAGAGG 


GAAAAAATAA 


AACCCTCCCC 


wAw w/\w w i. ww 


TTCTCCCCAC 


360 


CCCTCGCCGC 


ACCACACACA 


GCGCGGGCTT 


CTAGCGCTCG 


GG^GGGGCGG 


GCCAGGCGCG 


420 


TCCTGCCTTC 


ATTTATCCAG 


CAGCTTTTCG 


G AAAAT G CAT 


TT'GGTGTTCG 

X X ww X w X X ww 


GAGTTTAATC 


480 


AGAAGACGAT 


TCCTGCCTCC 


GTCCCCGGCT 


CCTTCATCGT 


w w w A X w X w w w 


CTGTCTCTCT 


540 


CCTGGGGAGG 


CGTGAAGCGG 


TCCCGTGGAT 


AGAGATTCAT 


UwwXwXwX ww 


GCGCGTGTGT 


600 


GCGCGCGTAT 


AAATTGCCGA 


GAAGGGGAAA 


ACATCACAGG 


Aw.*. X wXwww« 


ATACCGGACT 


660 


GAAAATTGTA 


ATTCATCTGC 


CGCCGCCGCT 


GCCAAAAAAA 


21 ^ GTGG AGCT 

AXlw X w wrtw w X 


CTTGAGATCT 


720 


CCGGTTGGGA 




X X \jr\ wrt X X X w 


TGTGAAGCAG 


1 ^GTPTGGGA 
rvriw X w X vjvjun 


ATCGATCTGG 


780 


AAATCCTCCT 


AATTTTTACT 


CCCTCTCCCC 


CCGACTCCTG 


ATTCATTGGG 


AAGTTTCAAA 


84 0 


TCAGCTATAA 


CTGGAGAGTG 


CTGAAGATTG 


ATGGGATCGT 


TGCCTTATGC 


ATTTGTTTTG 


900 


GTTTTACAAA 


AAGGAAACTT 


GACAGAGGAT 


CATGCTGTAC 


TTAAAAAATA 


CAAGTAAGTC 


960 


TCGCACAGGA 


AATTGGTTTA 


ATGTAACTTT 


CAATGGAAAC 


CTTTGAGATT 


TTTTACTTAA 


1020 


AGTGCATTCG 


AGTAAATTTA 


ATTTCCAGGC 


AGCTTAATAC 


ATTGTTTTTA 


GCCGTGTTAC 


1080 


TTGTAGTGTG 


TATGCCCTGC 


TTTCACTCAG 


TGTGTACAGG 


GAAACGCACC 


TGATTTTTTA 


1140 


CTTATTAGTT 


TGTTTTTTCT 


TTAACCTTTC 


AGCATCACAG 


AGGAAGTAGA 


CTGATATTAA 


1200 


CAATACTTAC 


TAATAATAAC 


GTGCCTCATG 


AAATAAAGAT 


CCGAAAGGAA 


TTGGAATAAA 


1260 


AATTTCCTGC 


GTCTCATGCC 


AAGAGGGAAA 


CACCAGAATC 


AAGTGTTCCG 


CGTGATTGAA 


1320 



s 



GACACCCCCT 


CGTCCAAGAA 


TGCAAAGCAC 


ATCCAATAAA 


ATAGCTGGAT 


TATAACTCCT 


1380 


CTTCTTTCTC 


TGGGGGCCGT 


GGGGTGGGAG 


CTGGGGCGAG 


AGGTGCCGTT 


GGCCCCCGTT 


1440 


GCTTTTCCTC 


TGGGAAGGAT 


GGCGCACGCT 


GGGAGAACGG 


GGTACGACAA 


CCGGG AG AT A 


1500 


GTGATGAAGT 


ACATCCATTA 


TAAGCTGTCG 


CAGAGGGGCT 


ACGAGTGGGA 


TGCGGGAGAT 


1560 


GTGGGCGCCG 


CGCCCCCGGG 


GGCCGCCCCC 


GCACCGGGCA 


TCTTCTCCTC 


CCAGCCCGGG 


1620 


CACACGCCCC 


ATCCAGCCGC 


ATCCCGCGAC 


CCGGTCGCCA 


GGACCTCGCC 


GCTGCAGACC 


1680 


CCGGCTGCCC 


CCGGCGCCGC 


CGCGGGGCCT 


GCGCTCAGCC 


CGGTGCCACC 


TGTGGTCCAC 


1740 


CTGGCCCTCC 


GCCAAGCCGG 


CGACGACTTC 


TCCCGCCGCT 


ACCGCGGCGA 


CTTCGCCGAG 


1800 


ATGTCCAGCC 


AGCTGCACCT 


GACGCCCTTC 


ACCGCGCGGG 


GACGCTTTGC 


CACGGTGGTG 


1860 


GAGGAGCTCT 


TCAGGGACGG 


GGTGAACTGG 


GGGAGGATTG 


TGGCCTTCTT 


TGAGTTCGGT 


1920 


GGGGTCATGT 


GTGTGGAGAG 


CGTCAACCGG 


GAGATGTCGC 


CCCTGGTGGA 


CAACATCGCC 


1980 


CTGTGGATGA 


CTGAGTACCT 


GAACCGGCAC 


CTGCACACCT 


GGATCCAGGA 


TAACGGAGGC 


2040 


TGGGATGCCT 


TTGTGGAACT 


GTACGGCCCC 


AGCATGCGGC 


CTCTGTTTGA 


TTTCTCCTGG 


2100 


CTGTCTCTGA 


AGACTCTGCT 


CAGTTTGGCC 


CTGGTGGGAG 


CTTGCATCAC 


CCTGGGTGCC 


2160 


TATCTGAGCC 


ACAAGTGAAG 


TCAACATGCC 


TGCCCCAAAC 


AAATATGCAA 


AAGGTTCACT 


2220 


AAAGCAGTAG 


AAATAATATG 


CATTGTCAGT 


GATGTACCAT 


GAAACAAAGC 


TGCAGGCTGT 


2280 


TTAAGAAAAA 


ATAACACACA 


T ATAAAC AT C 


ACACACACAG 


ACAGACACAC 


ACACACACAA 


2340 


CAATTAACAG 


TCTTCAGGCA 


AAACGTCGAA 


TCAGCTATTT 


ACTGCCAAAG 


GGAAATATCA 


2400 


TTTATTTTTT 


ACATTATTAA 


GAAAAAAGAT 


TTATTTATTT 


AAGACAGTCC 


CATCAAAACT 


2460 


CCGTCTTTGG 


AAATCCGACC 


ACTAATTG CC 


AAACACCGCT 


TCGTGTGGCT 


CCACCTGGAT 


2520 


GTTCTGTGCC 


TGTAAACATA 


GATTCGCTTT 


CCATGTTGTT 


GGCCGGATCA 


CCATCTGAAG 


2580 


AGCAGACGGA 


TGGAAAAAGG 


ACCTGATCAT 


TGGGGAAGCT 


GGCTTTCTGG 


CTGCTGGAGG 


2640 


CTGGGGAGAA 


GGTGTTCATT 


CACTTGCATT 


TCTTTGCCCT 


GGGGGCGTGA 


TATTAACAGA 


2700 


GGGAGGGTTC 


CCGTGGGGGG 


AAGTCCATGC 


CTCCCTGGCC 


TGAAGAAGAG 


ACTCTTTGCA 


2760 


TATGACTCAC 


ATGATGCATA 


CCTGGTGGGA 


GGAAAAGAGT 


TGGGAACTTC 


AGATGGACCT 


2820 


AGTACCCACT 


GAGATTTCCA 


CGCCGAAGGA 


CAGCGATGGG 


AAAAATGCCC 


TT AAAT CAT A 


2880 


GGAAAGTATT 


TTTTTAAGCT 


ACCAATTGTG 


CCGAGAAAAG 


CATTTTAGCA 


ATTTATACAA 


2940 


TATCATCCAG 


TACCTTAAAC 


CCTGATTGTG 


TATATTCATA 


TATTTTGGAT 


ACGCACCCCC 


3000 



CAACTCCCAA 


TACTGGCTCT 


GTCTGAGTAA 


GAAACAGAAT 


CCTCTGGAAC 


TTGAGGAAGT 


3060 


GAACATTTCG 


GTGACTTCCG 


ATCAGGAAGG 


CTAGAGTTAC 


CCAGAGCATC 


AGGCCGCCAC 


3120* 


AAGTGCCTGC 


TTTTAGGAGA 


CCGAAGTCCG 


CAGAACCTAC 


CTGTGTCCCA 


GCTTGGAGGC 


3180 


CTGGTCCTGG 


AACTGAGCCG 


GGCCCTCACT 


GGCCTCCTCC 


AGGGATGATC 


AACAGGGTAG 


3240 


TGTGGTCTCC 


GAATGTCTGG 


AAGCTGATGG 


ATGGAGCTCA 


GAATTCCACT 


GTCAAGAAAG 


3300 


AGCAGTAGAG 


GGGTGTGGCT 


GGGCCTGTCA 


CCCTGGGGCC 


CTCCAGGTAG 


GCCCGTTTTC 


3360 


ACGTGGAGCA 


TAGGAGCCAC 


GACCCTTCTT 


AAGACATGTA 


TCACTGTAGA 


GGGAAGGAAC 


3420 


AGAGGCCCTG 


GGCCTTCCTA 


TCAGAAGGAC 


ATGGTGAAGG 


CTGGGAACGT 


GAGGAGAGGC 


3480 


AATGGCCACG 


GCCCATTTTG 


GCTGTAGCAC 


ATGGCACGTT 


GGCTGTGTGG 


CCTTGGCCAC 


3540 


CTGTGAGTTT 


AAAGCAAGGC 


TTTAAATGAC 


TTTGGAGAGG 


GTCACAAATC 


CTAAAAGAAG 


3600 


CATTGAAGTG 


AGGTGTCATG 


GATTAATTGA 


CCCCTGTCTA 


TGGAATTACA 


TGTAAAACAT 


3660 


TATCTTGTCA 


CTGTAGTTTG 


GTTTTATTTG 


AAAACCTGAC 


AAAAAAAAAG 


TTCCAGGTGT 


3720 


GGAATATGGG 


GGTTATCTGT 


ACATCCTGGG 


GCATTAAAAA 


AAAATCAATG 


GTGGGGAACT 


3780 


ATAAAGAAGT 


AACAAAAGAA 


GTGACATCTT 


CAGCAAATAA 


ACTAGGAAAT 


TTTTTTTT CT 


3840 


TCCAGTTTAG 


AATCAGCCTT 


GAAACATTGA 


TGGAATAACT 


CTGTGGCATT 


ATTGCATTAT 


3900 


ATACCATTTA 


TCTGTATTAA 


CTTTGGAATG 


TACTCTGTTC 


AATGTTTAAT 


GCTGTGGTTG 


3960 


ATATTTCGAA 


AGCTGCTTTA 


AAAAAATACA 


TGCATCTCAG 


CGTTTTTTTG 


TTTTTAATTG 


4020 


TATTTAGTTA 


TGGCCTATAC 


ACTATTTGTG 


AGCAAAGGTG 


ATCGTTTTCT 


GTTTGAGATT 


4080 


TTTATCTCTT 


GATTCTTCAA 


AAGCATTCTG 


AGAAGGTGAG 


ATAAGCCCTG 


AGTCTCAGCT 


4140 


ACCTAAGAAA 


AACCTGGATG 


TCACTGGCCA 


CTGAGGAGCT 


TTGTTTCAAC 


CAAGTCATGT 


4200 


GCATTTCCAC 


GTCAACAGAA 


TTGTTTATTG 


TGACAGTTAT 


ATCTGTTGTC 


CCTTTGACCT 


4260 


TGTTTCTTGA 


AGGTTTCCTC 


GTCCCTGGGC 


AATTCCGCAT 


TTAATTCATG 


GTATTCAGGA 


4320 


TTACATGCAT 


GTTTGGTTAA 


ACCCATGAGA 


TTCATTCAGT 


TAAAAATCCA 


GATGGCGAAT 


4380 


GACCAGCAGA 


TTCAAATCTA 


TGGTGGTTTG 


ACCTTTAGAG 


AGTTGCTTTA 


CGTGGCCTGT 


4440 


TTCAACACAG 


ACCCACCCAG 


AGCCCTCCTG 


CCCTCCTTCC 


GCGGGGGCTT 


TCTCATGGCT 


4500 


GTCCTTCAGG 


GTCTTCCTGA 


AATGCAGTGG 


TCGTTACGCT 


CCACCAAGAA 


AGCAGGAAAC 


4560 


CTGTGGTATG 


AAGCCAGACC 


TCCCCGGCGG 


GCCTCAGGGA 


ACAGAATGAT 


CAGACCTTTG 


4620 


AATGATTCTA 


ATI TTTAA GC 


AAAATATTAT 


TTTATGAAAG 


GTTTACATTG 


TCAAAGTGAT 


4680 




GAATATGGAA 


TATCCAATCC 


TGTGCTGCTA 


TCCTGCCAAA 


ATCATTTTAA 


TGGAGTCAGT 


4740 


TTGCAGTATG 


CTCCACGTGG 


TAAGATCCTC 


CAAGCTGCTT 


TAGAAGTAAC 


AATGAAGAAC 


4800 


GTGGACGTTT 


TTAATATAAA 


GCCTGTTTTG 


TCTTTTGTTG 


TTGTTCAAAC 


GGGATTCACA 


4860 


GAGTATTTGA 


AAAATGTATA 


TATATTAAGA 


GGTCACGGGG 


GCTAATTGCT 


AGCTGGCTGC 


4920 


CTTTTGCTGT 


GGGGTTTTGT 


TACCTGGTTT 


TAATAACAGT 


AAATGTGCCC 


AGCCTCTTGG 


4980 


CCCCAGAACT 


GTACAGTATT 


GTGGCTGCAC 


TTGCTCTAAG 


AGTAGTTGAT 


GTTGCATTTT 


5040 


CCTTATTGTT 


AAAAACATGT 


TAGAAGCAAT 


GAATGTATAT 


AAAAGC 




5086 



# 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

fix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..717 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

S S SS S SS SS - SS SS 2j SS 5 SS S S3 S2 

5 10 



1 



AAG TAG ATC CAT TAT AAG CTG TCG CAG AGG GGC TAC GAG TGG GAT GCG 
Lys Tyr He His Tyr Lys Leu Ser Gin Arg Gly Tyr giu irp y 

20 25 



S3 2? S3 SS S S £ SS SS S S SS S SS SS SS 



35 40 



SS S E SS SS SS SS £ SS SS SS SS S S 25 25 



50 55 



SS SS 25 Arg SS S SS - S SS !2 S SS SS SS 
SS S SS SS S SS SS SS S3 SS SS S 55 SS SS SS 

85 90 



CTC CGC CAA GCC GGC GAC GAC TTC TCC CGC CGC TAC CGC GGC GAC TTC 
Leu Arg Gin Ala Gly Asp Asp Phe Ser Arg Arg Tyr Arg y 

100 105 

GCC GAG ATG TCC MC CAG CTC CAC CTG ACG CCC TTC ACC GCG CGG GGA 

Ala Glu Met Ser Ser Gin Leu His Leu inr rro r 
115 120 

CGC TTT GCC ACG GTG GTG GAG GAG CTC TTC AGG GAC GGG GTG AAC TGG 
Arg Se Ala ?nr Val Val Glu Glu Leu Phe Arg Asp Gly val Asn .rp 

130 135 



48 



96 



144 



192 



240 



288 



336 



384 



432 
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GGG AGG ATT GTG GCC TTC TTT GAG TTC GGT GGG GTC ATG TGT GTG GAG 480 

Gly Arg He Val Ala Phe Phe Glu Phe Gly Gly Val Met Cys Val Glu 
145 150 155 160 

AGC GTC AAC CGG GAG ATG TCG CCC CTG GTG GAC AAC ATC GCC CTG TGG 528 
Ser Val Asn Arg Glu Met Ser Pro Leu Val Asp Asn He Ala Leu Trp 

165 170 175 

ATG ACT GAG TAC CTG AAC CGG CAC CTG CAC ACC TGG ATC CAG GAT AAC 57 6 

Met Thr Glu Tyr Leu Asn Arg His Leu His Thr Trp He Gin Asp Asn 

180 185 190 

GGA GGC TGG GAT GCC TTT GTG GAA CTG TAC GGC CCC AGC ATG CGG CCT 624 
Gly Gly Trp Asp Ala Phe Val Glu Leu Tyr Gly Pro Ser Met Arg Pro 
195 200. 205 

CTG TTT GAT TTC TCC TGG CTG TCT CTG AAG ACT CTG CTC AGT TTG GCC 672 
Leu Phe Asp Phe Ser Trp Leu Ser Leu Lys Thr Leu Leu Ser Leu Ala 
210 215 220 

CTG GTG GGA GCT TGC ATC ACC CTG GGT GCC TAT CTG AGC CAC AAG 717 
Leu Val Gly Ala Cys He Thr Leu Gly Ala Tyr Leu Ser His Lys 
225 230 235 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Ala His Ala Gly Arg Thr Gly Tyr Asp Asn Arg Glu lie Val Met 
1 5 10 15 

Lys Tyr lie His Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Ala 

20 25 30 

Gly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro Ala Pro Gly lie 

35 40 45 

Phe Ser Ser Gin Pro Gly His Thr Pro His Pro Ala Ala Ser Arg Asp 
50 55 60 

Pro Val Ala Arg Thr Ser Pro Leu Gin Thr Pro Ala Ala Pro Gly Ala 
65 70 75 80 

Ala Ala Gly Pro Ala Leu Ser Pro Val Pro Pro Val Val His Leu Ala 

85 90 95 

Leu Arg Gin Ala Gly Aso Asp Phe Ser Arg Arg Tyr Arg Gly Asp Phe 

100 105 HO 

Ala Glu Met Ser Ser Gin Leu His Leu Thr Pro Phe Thr' Ala Arg Gly 
115 120 125 

Arg Phe Ala Thr Val Val Glu Glu Leu Phe Arg Asp Gly Val Asn Trp 
130 135 140 

Gly Arg lie Val Ala Phe Phe Glu Phe Gly Gly Val Met Cys Val Glu 
145 150 155 160 

Ser Val Asn Arg Glu Met Ser Pro Leu Val Asp Asn lie Ala Leu Trp 

165 1"0 175 

Met Thr Glu Tyr Leu Asn Arg His Leu His Thr Trp He Gin Asp Asn 

180 185 190 

Gly Gly Trp Asp Ala Phe Val Glu Leu Tyr Gly Pro Ser Met Arg Pro 
195 200 205 

Leu Phe Asp Phe Ser Trp Leu Ser Leu Lys Thr Leu Leu Ser Leu Ala 
210 215 220 



Leu Val Gly Ala Cys lie Thr Leu Gly Ala Tyr Leu Ser His Lys 
225 230 235 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..615 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ATG GCG CAC GCT GGG AGA ACG GGG TAC GAC AAC CGG GAG ATA GTG ATG 
Met Ala His Ala Gly Arg Thr Gly Tyr Asp Asn Arg Glu He Val Met 
15 10 15 

AAG TAC ATC CAT TAT AAG CTG TCG CAG AGG GGC TAC GAG TGG GAT GCG 
Lys Tyr He His Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Ala 

20 25 30 

GGA GAT GTG GGC GCC GCG CCC CCG GGG GCC GCC CCC GCA CCG GGC ATC 
Gly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro Ala Pro Gly He 
35 40 45 

TTC TCC TCC CAG CCC GGG CAC ACG CCC CAT CCA GCC GCA TCC CGC GAC 
Phe Ser Ser Gin Pro Gly His Thr Pro His Pro Ala Ala Ser Arg Asp 
50 55 60 

CCG GTC GCC AGG ACC TCG CCG CTG CAG ACC CCG GCT GCC CCC GGC GCC 

Pro Val Ala Arg Thr Ser Pro Leu Gin Thr Pro Ala Ala Pro Gly Ala 

65 70 75 80 

GCC GCG GGG CCT GCG CTC AGC CCG GTG CCA CCT GTG GTC CAC CTG GCC 
Ala Ala Gly Pro Ala Leu Ser Pro Val Pro Pro Val Val His Leu Ala 

85 90 95 




CTC CGC CAA GCC GGC GAC GAC TTC TCC CGC CGC TAC CGC GGC GAC TTC 

Leu Arg Gin Ala Gly Asp Asp Phe Ser Arg Arg Tyr Arg Gly Asp Phe 

100 105 110 

GCC GAG ATG TCC AGC CAG CTG CAC CTG ACG CCC TTC ACC GCG CGG GGA 

Ala Glu Met Ser Ser Gin Leu His Leu Thr Pro Phe Thr Ala Arg Gly 

115 120 125 

CGC TTT GCC ACG GTG GTG GAG GAG CTC TTC AGG GAC GGG GTG AAC TGG 

Arg Phe Ala Thr Val Val Glu Glu Leu Phe Arg Asp Gly Val Asn Trp 

130 135 140 

GGG AGG ATT GTG GCC TTC TTT GAG TTC GGT GGG GTC ATG TGT GTG GAG 

Gly Arg lie Val Ala Phe Phe Glu Phe Gly Gly Val Met Cys Val Glu 

145 150 155 160 

AGC GTC AAC CGG GAG ATG TCG CCC CTG GTG GAC AAC ATC GCC CTG TGG 

Ser Val Asn Arg Glu Met Ser Pro Leu Val Asp Asn lie Ala Leu Trp 

165 170 1~5 

ATG ACT GAG TAC CTG AAC CGG CAC CTG CAC ACC TGG ATC CAG GAT AAC 

Met Thr Glu Tyr Leu Asn Ars His Leu His Thr Trp lie Gin Asp Asn 

180 " 135 190 

GGA GGC TGG GTA GGT GCA TCT GGT GAT GTG AGT CTG GGC 

Gly Gly Trp Val Gly Ala Ser Gly Asp Val Ser Leu Gly 

195 200 205 



336 



384 



432 



480 



528 



57 5 



615 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 05 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Ala His Ala Gly Arg Thr Gly Tyr Asp Asn Arg Glu He Val Met 
15 10 1= 

Lys Tyr He His Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Ala 

20 25 30 

Gly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro Ala Pro Gly He 
35 40 45 

Phe Ser Ser Gin Pro Gly His Thr Pro His Pro Ala Ala Ser Arg Asp 
50 55 60 

Pro Val Ala Arg Thr Ser Pro Leu Gin Thr Pro Ala Ala Pro Gly Ala 
65 70 75 80 



Ala Ala Gly Pro 



Leu Arg Gin Ala 

100 

Ala Glu Met Ser 
115 

Arg Phe Ala Thr 

130 

Gly Arg lie Val 
145 

Ser Val Asn Arg 



Met Thr Glu Tyr 

180 

Gly Gly Trp Val 
195 



Ala Leu Ser Pro 
85 

Gly Asp Asp Phe 



Ser Gin Leu His 

120 

Val Val Glu Glu 

. 135 

Ala Phe Phe Glu 
150 

Glu Met Ser Pro 
165 

Leu Asn Arg His 



Gly Ala Ser Gly 

200 



Val Pro Pro Val 
90 

Ser Arg Arg Tyr 
105 

Leu Thr Pro Phe 



Leu Phe Arg Asp 

140 

Phe Gly Gly Val 
155 

Leu Val Asp Asn 
170 

Leu His Thr Trp 
185 

Asp Val Ser Leu 



Val His Leu Ala 
95 

Arg Gly Asp Phe 
110 

Thr Ala Arg Gly 
125 

Gly Val Asn Trp 



Met Cys Val Glu 

160 

lie Ala Leu Trp 
175 

lie Gin Asp Asn 
190 

Gly 

205 



